The Data Warehouse ETL Toolkit by Ralph Kimball & Joe Caserta

The Data Warehouse ETL Toolkit by Ralph Kimball & Joe Caserta

Author:Ralph Kimball & Joe Caserta [Kimball, Ralph & Caserta, Joe]
Language: eng
Format: epub
ISBN: 9781118079683
Publisher: Wiley
Published: 2011-04-21T05:00:00+00:00


Physically Deleting Facts

Physically deleting facts means data is permanently removed from the data warehouse. When you have a requirement to physically delete records, make sure the user completely understands that the data will never be able to be retrieved once it is deleted.

Users often carry a misconception that once data enters the data warehouse, it is there forever. So when users say they will never have a reason to see deleted data, never and see need to be clarified. Make sure they say exactly what they mean and mean what they say.

Never. It is quite common for users to think in terms of today’s requirements because it is based on their current way of thinking about the data they use. Users who’ve never been exposed to a data warehouse may not be used to having certain types of history available to them. There’s an old aphorism: You can’t miss what you’ve never had. In most cases, when a user says never, he or she means rarely. Make sure your users are well aware that physical deletion is a permanent removal of the record.

See. When a user says see, most likely he or she is referring to the appearance of data in reports. It’s quite common that users have no idea what exists in raw data. All data is usually delivered through some sort of delivery mechanism such as business-intelligence tools or reports that may be automatically filtering unwanted data. It’s best to check with the team responsible for data presentation to confirm such requirements. If no such team exists, make sure your users are well aware that physical deletion is a permanent removal of the record in the data warehouse.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Popular ebooks
In-Memory Analytics with Apache Arrow by Matthew Topol(2705)
Data Forecasting and Segmentation Using Microsoft Excel by Fernando Roque(2702)
PostgreSQL 14 Administration Cookbook by Simon Riggs(2228)
Cloud Auditing Best Practices: Perform Security and IT Audits across AWS, Azure, and GCP by building effective cloud auditing plans by Shinesa Cambric Michael Ratemo(1840)
Architects of Intelligence_The Truth About AI From the People Building It by Martin Ford(1249)
In-Memory Analytics with Apache Arrow: Perform fast and efficient data analytics on both flat and hierarchical structured data by Matthew Topol(1048)
Mastering Azure Virtual Desktop: The Ultimate Guide to the Implementation and Management of Azure Virtual Desktop by Ryan Mangan(1031)
Automated Machine Learning in Action by Qingquan Song Haifeng Jin Xia Hu(916)
Python GUI Programming with Tkinter, 2nd edition by Alan D. Moore(883)
Ansible for Real-Life Automation - A complete Ansible handbook filled with practical IT automation use cases (2022) by Packt(754)
Learn Wireshark - A definitive guide to expertly analyzing protocols and troubleshooting networks using Wireshark - 2nd Edition (2022) by Packt(754)
Data Engineering with Scala and Spark by Eric Tome Rupam Bhattacharjee David Radford(434)
Introduction to Algorithms, Fourth Edition by unknow(390)
ABAP Development for SAP HANA by Unknown(369)
Automated Machine Learning in Action by Qingquan Song & Haifeng Jin & Xia Hu(315)
Kubernetes Secrets Handbook by Emmanouil Gkatziouras | 
Rom Adams
 | Chen Xi(297)
The AWK Programming Language by Aho Alfred V. Kernighan Brian W. Weinberger Peter J. & Brian W. Kernighan & Peter J. Weinberger(291)
Asynchronous Programming in Rust by Carl Fredrik Samson;(275)
Learn Enough Developer Tools to Be Dangerous: Git Version Control, Command Line, and Text Editors Essentials by Michael Hartl(266)
Machine Learning for Imbalanced Data by Kumar Abhishek Dr. Mounir Abdelaziz(261)